Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 556205 |
| Missing cells | 452079 |
| Missing cells (%) | 2.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 123.1 MiB |
| Average record size in memory | 232.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Unsupported | 1 |
| Categorical | 14 |
| DateTime | 2 |
CRASH TIME has a high cardinality: 1440 distinct values | High cardinality |
LOCATION has a high cardinality: 122234 distinct values | High cardinality |
CONTRIBUTING FACTOR VEHICLE 1 has a high cardinality: 55 distinct values | High cardinality |
VEHICLE TYPE CODE 1 has a high cardinality: 839 distinct values | High cardinality |
VEHICLE TYPE CODE 2 has a high cardinality: 890 distinct values | High cardinality |
df_index is highly correlated with COLLISION_ID and 1 other fields | High correlation |
NUMBER OF PERSONS INJURED is highly correlated with NUMBER OF MOTORIST INJURED | High correlation |
NUMBER OF PERSONS KILLED is highly correlated with NUMBER OF PEDESTRIANS KILLED and 1 other fields | High correlation |
NUMBER OF PEDESTRIANS KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
NUMBER OF MOTORIST INJURED is highly correlated with NUMBER OF PERSONS INJURED | High correlation |
NUMBER OF MOTORIST KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
COLLISION_ID is highly correlated with df_index and 1 other fields | High correlation |
Year is highly correlated with df_index and 1 other fields | High correlation |
DayOfWeekNumber is highly correlated with hourofweek | High correlation |
hourofday is highly correlated with timeofdaypercent | High correlation |
timeofdaypercent is highly correlated with hourofday | High correlation |
hourofweek is highly correlated with DayOfWeekNumber | High correlation |
df_index is highly correlated with COLLISION_ID and 1 other fields | High correlation |
LATITUDE is highly correlated with LONGITUDE | High correlation |
LONGITUDE is highly correlated with LATITUDE | High correlation |
NUMBER OF PERSONS INJURED is highly correlated with NUMBER OF MOTORIST INJURED | High correlation |
NUMBER OF PERSONS KILLED is highly correlated with NUMBER OF PEDESTRIANS KILLED and 1 other fields | High correlation |
NUMBER OF PEDESTRIANS KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
NUMBER OF MOTORIST INJURED is highly correlated with NUMBER OF PERSONS INJURED | High correlation |
NUMBER OF MOTORIST KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
COLLISION_ID is highly correlated with df_index and 1 other fields | High correlation |
Year is highly correlated with df_index and 1 other fields | High correlation |
DayOfWeekNumber is highly correlated with hourofweek | High correlation |
hourofday is highly correlated with timeofdaypercent | High correlation |
timeofdaypercent is highly correlated with hourofday | High correlation |
hourofweek is highly correlated with DayOfWeekNumber | High correlation |
df_index is highly correlated with COLLISION_ID and 1 other fields | High correlation |
NUMBER OF PERSONS INJURED is highly correlated with NUMBER OF MOTORIST INJURED | High correlation |
NUMBER OF PERSONS KILLED is highly correlated with NUMBER OF PEDESTRIANS KILLED and 1 other fields | High correlation |
NUMBER OF PEDESTRIANS KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
NUMBER OF MOTORIST INJURED is highly correlated with NUMBER OF PERSONS INJURED | High correlation |
NUMBER OF MOTORIST KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
COLLISION_ID is highly correlated with df_index and 1 other fields | High correlation |
Year is highly correlated with df_index and 1 other fields | High correlation |
DayOfWeekNumber is highly correlated with hourofweek | High correlation |
hourofday is highly correlated with timeofdaypercent | High correlation |
timeofdaypercent is highly correlated with hourofday | High correlation |
hourofweek is highly correlated with DayOfWeekNumber | High correlation |
NUMBER OF PERSONS KILLED is highly correlated with NUMBER OF MOTORIST KILLED and 1 other fields | High correlation |
NUMBER OF MOTORIST KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
NUMBER OF PEDESTRIANS KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
df_index is highly correlated with COLLISION_ID and 2 other fields | High correlation |
LATITUDE is highly correlated with LONGITUDE | High correlation |
LONGITUDE is highly correlated with LATITUDE | High correlation |
NUMBER OF PERSONS INJURED is highly correlated with NUMBER OF MOTORIST INJURED | High correlation |
NUMBER OF PERSONS KILLED is highly correlated with NUMBER OF PEDESTRIANS KILLED and 1 other fields | High correlation |
NUMBER OF PEDESTRIANS KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
NUMBER OF MOTORIST INJURED is highly correlated with NUMBER OF PERSONS INJURED | High correlation |
NUMBER OF MOTORIST KILLED is highly correlated with NUMBER OF PERSONS KILLED | High correlation |
COLLISION_ID is highly correlated with df_index and 2 other fields | High correlation |
Year is highly correlated with df_index and 1 other fields | High correlation |
DayOfWeekNumber is highly correlated with DayOfWeek and 1 other fields | High correlation |
DayOfWeek is highly correlated with DayOfWeekNumber and 1 other fields | High correlation |
hourofday is highly correlated with timeofdaypercent and 1 other fields | High correlation |
timeofdaypercent is highly correlated with hourofday and 1 other fields | High correlation |
month is highly correlated with df_index and 1 other fields | High correlation |
hourofweek is highly correlated with DayOfWeekNumber and 3 other fields | High correlation |
BOROUGH has 196099 (35.3%) missing values | Missing |
LATITUDE has 41428 (7.4%) missing values | Missing |
LONGITUDE has 41428 (7.4%) missing values | Missing |
LOCATION has 41428 (7.4%) missing values | Missing |
VEHICLE TYPE CODE 2 has 125877 (22.6%) missing values | Missing |
LATITUDE is highly skewed (γ1 = -26.02064086) | Skewed |
df_index has unique values | Unique |
COLLISION_ID has unique values | Unique |
CRASH DATE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
NUMBER OF PERSONS INJURED has 432082 (77.7%) zeros | Zeros |
NUMBER OF PEDESTRIANS INJURED has 528873 (95.1%) zeros | Zeros |
NUMBER OF MOTORIST INJURED has 473844 (85.2%) zeros | Zeros |
DayOfWeekNumber has 79914 (14.4%) zeros | Zeros |
hourofday has 20154 (3.6%) zeros | Zeros |
minute has 111720 (20.1%) zeros | Zeros |
timeofdaypercent has 8482 (1.5%) zeros | Zeros |
Reproduction
| Analysis started | 2022-05-08 15:36:54.881787 |
|---|---|
| Analysis finished | 2022-05-08 15:40:00.949927 |
| Duration | 3 minutes and 6.07 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
df_index
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 556205 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 397820.5303 |
| Minimum | 13 |
|---|---|
| Maximum | 688757 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 103777.2 |
| Q1 | 262458 |
| median | 410423 |
| Q3 | 549474 |
| 95-th percentile | 660789.8 |
| Maximum | 688757 |
| Range | 688744 |
| Interquartile range (IQR) | 287016 |
Descriptive statistics
| Standard deviation | 178624.1948 |
|---|---|
| Coefficient of variation (CV) | 0.4490069797 |
| Kurtosis | -1.040195417 |
| Mean | 397820.5303 |
| Median Absolute Deviation (MAD) | 143508 |
| Skewness | -0.2198333476 |
| Sum | 2.21269768 × 1011 |
| Variance | 3.190660295 × 1010 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 8196 | 1 | < 0.1% |
| 435073 | 1 | < 0.1% |
| 447375 | 1 | < 0.1% |
| 449422 | 1 | < 0.1% |
| 443277 | 1 | < 0.1% |
| 445324 | 1 | < 0.1% |
| 455563 | 1 | < 0.1% |
| 457610 | 1 | < 0.1% |
| 451465 | 1 | < 0.1% |
| 453512 | 1 | < 0.1% |
| Other values (556195) | 556195 |
| Value | Count | Frequency (%) |
| 13 | 1 | |
| 14 | 1 | |
| 39 | 1 | |
| 93 | 1 | |
| 423 | 1 | |
| 662 | 1 | |
| 690 | 1 | |
| 847 | 1 | |
| 1044 | 1 | |
| 1145 | 1 |
| Value | Count | Frequency (%) |
| 688757 | 1 | |
| 688756 | 1 | |
| 688755 | 1 | |
| 688754 | 1 | |
| 688753 | 1 | |
| 688752 | 1 | |
| 688751 | 1 | |
| 688750 | 1 | |
| 688749 | 1 | |
| 688748 | 1 |
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| 0:00 | 8482 |
|---|---|
| 16:00 | 7813 |
| 17:00 | 7558 |
| 15:00 | 7323 |
| 14:00 | 7230 |
| Other values (1435) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.734871136 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 22:50 |
|---|---|
| 2nd row | 15:49 |
| 3rd row | 16:30 |
| 4th row | 20:19 |
| 5th row | 15:20 |
Common Values
| Value | Count | Frequency (%) |
| 0:00 | 8482 | 1.5% |
| 16:00 | 7813 | 1.4% |
| 17:00 | 7558 | 1.4% |
| 15:00 | 7323 | 1.3% |
| 14:00 | 7230 | 1.3% |
| 18:00 | 6938 | 1.2% |
| 13:00 | 6655 | 1.2% |
| 12:00 | 6122 | 1.1% |
| 9:00 | 6021 | 1.1% |
| 8:00 | 5729 | 1.0% |
| Other values (1430) | 486334 |
Length
| Value | Count | Frequency (%) |
| 0:00 | 8482 | 1.5% |
| 16:00 | 7813 | 1.4% |
| 17:00 | 7558 | 1.4% |
| 15:00 | 7323 | 1.3% |
| 14:00 | 7230 | 1.3% |
| 18:00 | 6938 | 1.2% |
| 13:00 | 6655 | 1.2% |
| 12:00 | 6122 | 1.1% |
| 9:00 | 6021 | 1.1% |
| 8:00 | 5729 | 1.0% |
| Other values (1430) | 486334 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 196099 |
| Missing (%) | 35.3% |
| Memory size | 4.2 MiB |
| BROOKLYN | |
|---|---|
| QUEENS | |
| MANHATTAN | |
| BRONX | |
| STATEN ISLAND |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 7.296820936 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | MANHATTAN |
| 3rd row | QUEENS |
| 4th row | MANHATTAN |
| 5th row | MANHATTAN |
Common Values
| Value | Count | Frequency (%) |
| BROOKLYN | 117330 | |
| QUEENS | 101849 | |
| MANHATTAN | 68881 | 12.4% |
| BRONX | 59829 | 10.8% |
| STATEN ISLAND | 12217 | 2.2% |
| (Missing) | 196099 |
Length
Pie chart
| Value | Count | Frequency (%) |
| brooklyn | 117330 | |
| queens | 101849 | |
| manhattan | 68881 | |
| bronx | 59829 | |
| staten | 12217 | 3.3% |
| island | 12217 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 62407 |
|---|---|
| Distinct (%) | 12.1% |
| Missing | 41428 |
| Missing (%) | 7.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.66615177 |
| Minimum | 0 |
|---|---|
| Maximum | 41.12421 |
| Zeros | 751 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.598628 |
| Q1 | 40.668224 |
| median | 40.72028 |
| Q3 | 40.77434 |
| 95-th percentile | 40.862823 |
| Maximum | 41.12421 |
| Range | 41.12421 |
| Interquartile range (IQR) | 0.106116 |
Descriptive statistics
| Standard deviation | 1.556441398 |
|---|---|
| Coefficient of variation (CV) | 0.03827363373 |
| Kurtosis | 676.8732272 |
| Mean | 40.66615177 |
| Median Absolute Deviation (MAD) | 0.0527 |
| Skewness | -26.02064086 |
| Sum | 20933999.61 |
| Variance | 2.422509826 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 751 | 0.1% |
| 40.861862 | 431 | 0.1% |
| 40.696033 | 375 | 0.1% |
| 40.8047 | 331 | 0.1% |
| 40.798256 | 293 | 0.1% |
| 40.608757 | 281 | 0.1% |
| 40.75898 | 247 | < 0.1% |
| 40.733536 | 246 | < 0.1% |
| 40.820305 | 242 | < 0.1% |
| 40.76229 | 240 | < 0.1% |
| Other values (62397) | 511340 | |
| (Missing) | 41428 | 7.4% |
| Value | Count | Frequency (%) |
| 0 | 751 | |
| 40.49984 | 1 | < 0.1% |
| 40.500023 | 1 | < 0.1% |
| 40.500084 | 1 | < 0.1% |
| 40.501465 | 1 | < 0.1% |
| 40.50163 | 1 | < 0.1% |
| 40.501987 | 1 | < 0.1% |
| 40.502182 | 1 | < 0.1% |
| 40.50234 | 1 | < 0.1% |
| 40.502396 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 41.12421 | 1 | < 0.1% |
| 40.912884 | 6 | |
| 40.912468 | 9 | |
| 40.91222 | 1 | < 0.1% |
| 40.91217 | 1 | < 0.1% |
| 40.912117 | 1 | < 0.1% |
| 40.91208 | 6 | |
| 40.91206 | 1 | < 0.1% |
| 40.912018 | 2 | < 0.1% |
| 40.911667 | 1 | < 0.1% |
| Distinct | 41779 |
|---|---|
| Distinct (%) | 8.1% |
| Missing | 41428 |
| Missing (%) | 7.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.8168721 |
| Minimum | -201.23706 |
|---|---|
| Maximum | 0 |
| Zeros | 751 |
| Zeros (%) | 0.1% |
| Negative | 514026 |
| Negative (%) | 92.4% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | -201.23706 |
|---|---|
| 5-th percentile | -74.02453 |
| Q1 | -73.96899 |
| median | -73.92147 |
| Q3 | -73.86239 |
| 95-th percentile | -73.76111 |
| Maximum | 0 |
| Range | 201.23706 |
| Interquartile range (IQR) | 0.1066 |
Descriptive statistics
| Standard deviation | 3.023231539 |
|---|---|
| Coefficient of variation (CV) | -0.04095583371 |
| Kurtosis | 742.4564494 |
| Mean | -73.8168721 |
| Median Absolute Deviation (MAD) | 0.052834 |
| Skewness | 15.86448813 |
| Sum | -37999227.97 |
| Variance | 9.139928939 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 751 | 0.1% |
| -73.91282 | 447 | 0.1% |
| -73.98453 | 396 | 0.1% |
| -73.91243 | 339 | 0.1% |
| -73.89063 | 333 | 0.1% |
| -73.882744 | 310 | 0.1% |
| -73.89686 | 291 | 0.1% |
| -73.89083 | 284 | 0.1% |
| -74.038086 | 281 | 0.1% |
| -73.91727 | 267 | < 0.1% |
| Other values (41769) | 511078 | |
| (Missing) | 41428 | 7.4% |
| Value | Count | Frequency (%) |
| -201.23706 | 37 | |
| -74.742 | 3 | < 0.1% |
| -74.25393 | 1 | < 0.1% |
| -74.253006 | 1 | < 0.1% |
| -74.252884 | 1 | < 0.1% |
| -74.25218 | 1 | < 0.1% |
| -74.25209 | 1 | < 0.1% |
| -74.25188 | 1 | < 0.1% |
| -74.25184 | 2 | < 0.1% |
| -74.251495 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 751 | |
| -32.768513 | 2 | < 0.1% |
| -73.70055 | 2 | < 0.1% |
| -73.700584 | 5 | < 0.1% |
| -73.70071 | 2 | < 0.1% |
| -73.70073 | 1 | < 0.1% |
| -73.70076 | 1 | < 0.1% |
| -73.70084 | 2 | < 0.1% |
| -73.70099 | 14 | < 0.1% |
| -73.701004 | 1 | < 0.1% |
| Distinct | 122234 |
|---|---|
| Distinct (%) | 23.7% |
| Missing | 41428 |
| Missing (%) | 7.4% |
| Memory size | 4.2 MiB |
| (0.0, 0.0) | 751 |
|---|---|
| (40.861862, -73.91282) | 429 |
| (40.696033, -73.98453) | 375 |
| (40.8047, -73.91243) | 312 |
| (40.608757, -74.038086) | 281 |
| Other values (122229) |
Length
| Max length | 25 |
|---|---|
| Median length | 22 |
| Mean length | 21.71638982 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 70434 ? |
|---|---|
| Unique (%) | 13.7% |
Sample
| 1st row | (40.69754, -73.98312) |
|---|---|
| 2nd row | (40.671585, -73.99843) |
| 3rd row | (40.651974, -73.86542) |
| 4th row | (40.77161, -73.99046) |
| 5th row | (40.771038, -73.83413) |
Common Values
| Value | Count | Frequency (%) |
| (0.0, 0.0) | 751 | 0.1% |
| (40.861862, -73.91282) | 429 | 0.1% |
| (40.696033, -73.98453) | 375 | 0.1% |
| (40.8047, -73.91243) | 312 | 0.1% |
| (40.608757, -74.038086) | 281 | 0.1% |
| (40.798256, -73.82744) | 257 | < 0.1% |
| (40.733536, -73.87035) | 246 | < 0.1% |
| (40.820305, -73.89083) | 242 | < 0.1% |
| (40.675735, -73.89686) | 236 | < 0.1% |
| (40.83801, -73.87329) | 233 | < 0.1% |
| Other values (122224) | 511415 | |
| (Missing) | 41428 | 7.4% |
Length
| Value | Count | Frequency (%) |
| 0.0 | 1502 | 0.1% |
| 73.91282 | 447 | < 0.1% |
| 40.861862 | 431 | < 0.1% |
| 73.98453 | 396 | < 0.1% |
| 40.696033 | 375 | < 0.1% |
| 73.91243 | 339 | < 0.1% |
| 73.89063 | 333 | < 0.1% |
| 40.8047 | 331 | < 0.1% |
| 73.882744 | 310 | < 0.1% |
| 40.798256 | 293 | < 0.1% |
| Other values (104175) | 1024797 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
NUMBER OF PERSONS INJURED
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.302121539 |
| Minimum | 0 |
|---|---|
| Maximum | 22 |
| Zeros | 432082 |
| Zeros (%) | 77.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 22 |
| Range | 22 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6849383441 |
|---|---|
| Coefficient of variation (CV) | 2.267095376 |
| Kurtosis | 28.30558787 |
| Mean | 0.302121539 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.773722175 |
| Sum | 168040 |
| Variance | 0.4691405353 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 432082 | |
| 1 | 96100 | 17.3% |
| 2 | 18409 | 3.3% |
| 3 | 5965 | 1.1% |
| 4 | 2219 | 0.4% |
| 5 | 837 | 0.2% |
| 6 | 319 | 0.1% |
| 7 | 136 | < 0.1% |
| 8 | 61 | < 0.1% |
| 9 | 25 | < 0.1% |
| Other values (11) | 47 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 432082 | |
| 1 | 96100 | 17.3% |
| 2 | 18409 | 3.3% |
| 3 | 5965 | 1.1% |
| 4 | 2219 | 0.4% |
| 5 | 837 | 0.2% |
| 6 | 319 | 0.1% |
| 7 | 136 | < 0.1% |
| 8 | 61 | < 0.1% |
| 9 | 25 | < 0.1% |
| Value | Count | Frequency (%) |
| 22 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 18 | 1 | < 0.1% |
| 17 | 3 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 3 | < 0.1% |
| 12 | 7 | |
| 11 | 9 |
NUMBER OF PERSONS KILLED
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 14 |
| Missing (%) | < 0.1% |
| Memory size | 4.2 MiB |
| 0.0 | |
|---|---|
| 1.0 | 700 |
| 2.0 | 18 |
| 3.0 | 2 |
| 4.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 555470 | |
| 1.0 | 700 | 0.1% |
| 2.0 | 18 | < 0.1% |
| 3.0 | 2 | < 0.1% |
| 4.0 | 1 | < 0.1% |
| (Missing) | 14 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 555470 | |
| 1.0 | 700 | 0.1% |
| 2.0 | 18 | < 0.1% |
| 3.0 | 2 | < 0.1% |
| 4.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0510621084 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 528873 |
| Zeros (%) | 95.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2302640362 |
|---|---|
| Coefficient of variation (CV) | 4.509489394 |
| Kurtosis | 30.62087674 |
| Mean | 0.0510621084 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.886336249 |
| Sum | 28401 |
| Variance | 0.05302152638 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 528873 | |
| 1 | 26403 | 4.7% |
| 2 | 828 | 0.1% |
| 3 | 77 | < 0.1% |
| 4 | 15 | < 0.1% |
| 5 | 4 | < 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 528873 | |
| 1 | 26403 | 4.7% |
| 2 | 828 | 0.1% |
| 3 | 77 | < 0.1% |
| 4 | 15 | < 0.1% |
| 5 | 4 | < 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 6 | 4 | < 0.1% |
| 5 | 4 | < 0.1% |
| 4 | 15 | < 0.1% |
| 3 | 77 | < 0.1% |
| 2 | 828 | 0.1% |
| 1 | 26403 | 4.7% |
| 0 | 528873 |
NUMBER OF PEDESTRIANS KILLED
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| 0 | |
|---|---|
| 1 | 350 |
| 2 | 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 555852 | |
| 1 | 350 | 0.1% |
| 2 | 3 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 555852 | |
| 1 | 350 | 0.1% |
| 2 | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
NUMBER OF CYCLIST INJURED
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| 0 | |
|---|---|
| 1 | 14866 |
| 2 | 208 |
| 3 | 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 541127 | |
| 1 | 14866 | 2.7% |
| 2 | 208 | < 0.1% |
| 3 | 4 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 541127 | |
| 1 | 14866 | 2.7% |
| 2 | 208 | < 0.1% |
| 3 | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
NUMBER OF CYCLIST KILLED
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| 0 | |
|---|---|
| 1 | 70 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 556135 | |
| 1 | 70 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 556135 | |
| 1 | 70 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
NUMBER OF MOTORIST INJURED
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2235165092 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 473844 |
| Zeros (%) | 85.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6500539542 |
|---|---|
| Coefficient of variation (CV) | 2.908303984 |
| Kurtosis | 36.48045492 |
| Mean | 0.2235165092 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.552156025 |
| Sum | 124321 |
| Variance | 0.4225701434 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 473844 | |
| 1 | 55963 | 10.1% |
| 2 | 16998 | 3.1% |
| 3 | 5821 | 1.0% |
| 4 | 2187 | 0.4% |
| 5 | 822 | 0.1% |
| 6 | 312 | 0.1% |
| 7 | 129 | < 0.1% |
| 8 | 57 | < 0.1% |
| 9 | 25 | < 0.1% |
| Other values (11) | 47 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 473844 | |
| 1 | 55963 | 10.1% |
| 2 | 16998 | 3.1% |
| 3 | 5821 | 1.0% |
| 4 | 2187 | 0.4% |
| 5 | 822 | 0.1% |
| 6 | 312 | 0.1% |
| 7 | 129 | < 0.1% |
| 8 | 57 | < 0.1% |
| 9 | 25 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 18 | 1 | < 0.1% |
| 17 | 3 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 3 | < 0.1% |
| 12 | 6 | |
| 11 | 10 |
NUMBER OF MOTORIST KILLED
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| 0 | |
|---|---|
| 1 | 284 |
| 2 | 13 |
| 3 | 2 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 555905 | |
| 1 | 284 | 0.1% |
| 2 | 13 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 555905 | |
| 1 | 284 | 0.1% |
| 2 | 13 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 55 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1942 |
| Missing (%) | 0.3% |
| Memory size | 4.2 MiB |
| Driver Inattention/Distraction | |
|---|---|
| Unspecified | |
| Following Too Closely | |
| Failure to Yield Right-of-Way | |
| Backing Unsafely | |
| Other values (50) |
Length
| Max length | 53 |
|---|---|
| Median length | 21 |
| Mean length | 21.45328842 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Passing or Lane Usage Improper |
|---|---|
| 2nd row | Driver Inattention/Distraction |
| 3rd row | Reaction to Uninvolved Vehicle |
| 4th row | Following Too Closely |
| 5th row | Driver Inattention/Distraction |
Common Values
| Value | Count | Frequency (%) |
| Driver Inattention/Distraction | 140948 | |
| Unspecified | 128683 | |
| Following Too Closely | 48977 | 8.8% |
| Failure to Yield Right-of-Way | 38325 | 6.9% |
| Backing Unsafely | 24912 | 4.5% |
| Passing or Lane Usage Improper | 24084 | 4.3% |
| Passing Too Closely | 22589 | 4.1% |
| Unsafe Lane Changing | 17600 | 3.2% |
| Other Vehicular | 16500 | 3.0% |
| Turning Improperly | 12422 | 2.2% |
| Other values (45) | 79223 |
Length
| Value | Count | Frequency (%) |
| driver | 149719 | 12.0% |
| inattention/distraction | 140948 | 11.3% |
| unspecified | 128683 | 10.3% |
| too | 71566 | 5.7% |
| closely | 71566 | 5.7% |
| following | 48977 | 3.9% |
| to | 48003 | 3.8% |
| passing | 46673 | 3.7% |
| lane | 41984 | 3.4% |
| failure | 39856 | 3.2% |
| Other values (93) | 464434 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
COLLISION_ID
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 556205 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4102031.994 |
| Minimum | 3511951 |
|---|---|
| Maximum | 4513071 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 3511951 |
|---|---|
| 5-th percentile | 3849157.2 |
| Q1 | 3960658 |
| median | 4103330 |
| Q3 | 4242467 |
| 95-th percentile | 4353752.8 |
| Maximum | 4513071 |
| Range | 1001120 |
| Interquartile range (IQR) | 281809 |
Descriptive statistics
| Standard deviation | 162142.2421 |
|---|---|
| Coefficient of variation (CV) | 0.03952729826 |
| Kurtosis | -1.204276867 |
| Mean | 4102031.994 |
| Median Absolute Deviation (MAD) | 140900 |
| Skewness | -0.0090812053 |
| Sum | 2.281570705 × 1012 |
| Variance | 2.629010667 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4194304 | 1 | < 0.1% |
| 3895867 | 1 | < 0.1% |
| 4121225 | 1 | < 0.1% |
| 4123272 | 1 | < 0.1% |
| 4100743 | 1 | < 0.1% |
| 4102790 | 1 | < 0.1% |
| 4096645 | 1 | < 0.1% |
| 4098692 | 1 | < 0.1% |
| 4108931 | 1 | < 0.1% |
| 4110978 | 1 | < 0.1% |
| Other values (556195) | 556195 |
| Value | Count | Frequency (%) |
| 3511951 | 1 | |
| 3590187 | 1 | |
| 3591031 | 1 | |
| 3600567 | 1 | |
| 3818619 | 1 | |
| 3818620 | 1 | |
| 3818625 | 1 | |
| 3818637 | 1 | |
| 3818640 | 1 | |
| 3818641 | 1 |
| Value | Count | Frequency (%) |
| 4513071 | 1 | |
| 4512076 | 1 | |
| 4511910 | 1 | |
| 4511268 | 1 | |
| 4498730 | 1 | |
| 4498013 | 1 | |
| 4493620 | 1 | |
| 4491746 | 1 | |
| 4485338 | 1 | |
| 4483137 | 1 |
| Distinct | 839 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3858 |
| Missing (%) | 0.7% |
| Memory size | 4.2 MiB |
| Sedan | |
|---|---|
| Station Wagon/Sport Utility Vehicle | |
| Taxi | 25281 |
| Pick-up Truck | 16057 |
| Box Truck | 11036 |
| Other values (834) |
Length
| Max length | 38 |
|---|---|
| Median length | 5 |
| Mean length | 16.53787384 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 500 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | �MBU |
|---|---|
| 2nd row | Sedan |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Sedan |
Common Values
| Value | Count | Frequency (%) |
| Sedan | 254180 | |
| Station Wagon/Sport Utility Vehicle | 203500 | |
| Taxi | 25281 | 4.5% |
| Pick-up Truck | 16057 | 2.9% |
| Box Truck | 11036 | 2.0% |
| Bus | 8668 | 1.6% |
| Bike | 5675 | 1.0% |
| Tractor Truck Diesel | 4424 | 0.8% |
| Van | 3513 | 0.6% |
| Motorcycle | 3044 | 0.5% |
| Other values (829) | 16969 | 3.1% |
| (Missing) | 3858 | 0.7% |
Length
| Value | Count | Frequency (%) |
| sedan | 254521 | |
| vehicle | 203543 | |
| utility | 203511 | |
| station | 203500 | |
| wagon/sport | 203500 | |
| truck | 33241 | 2.8% |
| taxi | 25281 | 2.1% |
| pick-up | 16058 | 1.3% |
| box | 11085 | 0.9% |
| bus | 8769 | 0.7% |
| Other values (538) | 45407 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 890 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 125877 |
| Missing (%) | 22.6% |
| Memory size | 4.2 MiB |
| Sedan | |
|---|---|
| Station Wagon/Sport Utility Vehicle | |
| Taxi | |
| Pick-up Truck | 14931 |
| Box Truck | 12418 |
| Other values (885) |
Length
| Max length | 38 |
|---|---|
| Median length | 5 |
| Mean length | 16.25627428 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 531 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Taxi |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | Sedan |
| 4th row | Tractor Truck Diesel |
| 5th row | Sedan |
Common Values
| Value | Count | Frequency (%) |
| Sedan | 184957 | |
| Station Wagon/Sport Utility Vehicle | 152883 | |
| Taxi | 19015 | 3.4% |
| Pick-up Truck | 14931 | 2.7% |
| Box Truck | 12418 | 2.2% |
| Bike | 11773 | 2.1% |
| Bus | 7760 | 1.4% |
| Tractor Truck Diesel | 4375 | 0.8% |
| Van | 3275 | 0.6% |
| Motorcycle | 2363 | 0.4% |
| Other values (880) | 16578 | 3.0% |
| (Missing) | 125877 |
Length
| Value | Count | Frequency (%) |
| sedan | 185201 | |
| vehicle | 152922 | |
| utility | 152890 | |
| station | 152883 | |
| wagon/sport | 152883 | |
| truck | 33387 | 3.6% |
| taxi | 19016 | 2.0% |
| pick-up | 14932 | 1.6% |
| box | 12474 | 1.3% |
| bike | 11779 | 1.3% |
| Other values (562) | 46014 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
DateTime
Date
| Distinct | 1097 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| Minimum | 2018-01-01 00:00:00 |
|---|---|
| Maximum | 2021-01-01 00:00:00 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| 2018 | |
|---|---|
| 2019 | |
| 2020 | |
| 2021 | 257 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2019 |
|---|---|
| 2nd row | 2020 |
| 3rd row | 2020 |
| 4th row | 2020 |
| 5th row | 2020 |
Common Values
| Value | Count | Frequency (%) |
| 2018 | 231563 | |
| 2019 | 211485 | |
| 2020 | 112900 | |
| 2021 | 257 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2018 | 231563 | |
| 2019 | 211485 | |
| 2020 | 112900 | |
| 2021 | 257 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Time
Date
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| Minimum | 1900-01-01 00:00:00 |
|---|---|
| Maximum | 1900-01-01 23:59:00 |
DayOfWeekNumber
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.905658885 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 79914 |
| Zeros (%) | 14.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.942335525 |
|---|---|
| Coefficient of variation (CV) | 0.6684664657 |
| Kurtosis | -1.184752032 |
| Mean | 2.905658885 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.03410297032 |
| Sum | 1616142 |
| Variance | 3.772667292 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 89013 | |
| 3 | 84092 | |
| 1 | 82483 | |
| 2 | 81105 | |
| 0 | 79914 | |
| 5 | 74467 | |
| 6 | 65131 |
| Value | Count | Frequency (%) |
| 0 | 79914 | |
| 1 | 82483 | |
| 2 | 81105 | |
| 3 | 84092 | |
| 4 | 89013 | |
| 5 | 74467 | |
| 6 | 65131 |
| Value | Count | Frequency (%) |
| 6 | 65131 | |
| 5 | 74467 | |
| 4 | 89013 | |
| 3 | 84092 | |
| 2 | 81105 | |
| 1 | 82483 | |
| 0 | 79914 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| Friday | |
|---|---|
| Thursday | |
| Tuesday | |
| Wednesday | |
| Monday | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.155897556 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Tuesday |
|---|---|
| 2nd row | Tuesday |
| 3rd row | Thursday |
| 4th row | Friday |
| 5th row | Wednesday |
Common Values
| Value | Count | Frequency (%) |
| Friday | 89013 | |
| Thursday | 84092 | |
| Tuesday | 82483 | |
| Wednesday | 81105 | |
| Monday | 79914 | |
| Saturday | 74467 | |
| Sunday | 65131 |
Length
Pie chart
| Value | Count | Frequency (%) |
| friday | 89013 | |
| thursday | 84092 | |
| tuesday | 82483 | |
| wednesday | 81105 | |
| monday | 79914 | |
| saturday | 74467 | |
| sunday | 65131 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
hourofday
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.14288077 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 20154 |
| Zeros (%) | 3.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 14 |
| Q3 | 17 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 5.760881449 |
|---|---|
| Coefficient of variation (CV) | 0.4383271483 |
| Kurtosis | -0.4082702086 |
| Mean | 13.14288077 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.4445238765 |
| Sum | 7310136 |
| Variance | 33.18775507 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 39694 | 7.1% |
| 17 | 39205 | 7.0% |
| 14 | 37817 | 6.8% |
| 15 | 35932 | 6.5% |
| 18 | 34207 | 6.2% |
| 13 | 32317 | 5.8% |
| 12 | 30785 | 5.5% |
| 8 | 30279 | 5.4% |
| 9 | 28945 | 5.2% |
| 11 | 28795 | 5.2% |
| Other values (14) | 218229 |
| Value | Count | Frequency (%) |
| 0 | 20154 | |
| 1 | 9229 | 1.7% |
| 2 | 6848 | 1.2% |
| 3 | 6121 | 1.1% |
| 4 | 6803 | 1.2% |
| 5 | 7773 | 1.4% |
| 6 | 13039 | |
| 7 | 18275 | |
| 8 | 30279 | |
| 9 | 28945 |
| Value | Count | Frequency (%) |
| 23 | 15109 | 2.7% |
| 22 | 17846 | |
| 21 | 19590 | |
| 20 | 22723 | |
| 19 | 27633 | |
| 18 | 34207 | |
| 17 | 39205 | |
| 16 | 39694 | |
| 15 | 35932 | |
| 14 | 37817 |
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.26036623 |
| Minimum | 0 |
|---|---|
| Maximum | 59 |
| Zeros | 111720 |
| Zeros (%) | 20.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7 |
| median | 25 |
| Q3 | 40 |
| 95-th percentile | 54 |
| Maximum | 59 |
| Range | 59 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 18.12438195 |
|---|---|
| Coefficient of variation (CV) | 0.7470778379 |
| Kurtosis | -1.238250639 |
| Mean | 24.26036623 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 0.09370039452 |
| Sum | 13493737 |
| Variance | 328.4932211 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 111720 | |
| 30 | 74354 | |
| 45 | 32750 | 5.9% |
| 15 | 31420 | 5.6% |
| 50 | 30419 | 5.5% |
| 20 | 29812 | 5.4% |
| 40 | 28182 | 5.1% |
| 10 | 22176 | 4.0% |
| 25 | 14883 | 2.7% |
| 35 | 14620 | 2.6% |
| Other values (50) | 165869 |
| Value | Count | Frequency (%) |
| 0 | 111720 | |
| 1 | 2483 | 0.4% |
| 2 | 2703 | 0.5% |
| 3 | 2777 | 0.5% |
| 4 | 2817 | 0.5% |
| 5 | 13674 | 2.5% |
| 6 | 2873 | 0.5% |
| 7 | 2839 | 0.5% |
| 8 | 3422 | 0.6% |
| 9 | 2805 | 0.5% |
| Value | Count | Frequency (%) |
| 59 | 2296 | 0.4% |
| 58 | 3402 | 0.6% |
| 57 | 2920 | 0.5% |
| 56 | 2795 | 0.5% |
| 55 | 13894 | |
| 54 | 2936 | 0.5% |
| 53 | 3036 | 0.5% |
| 52 | 2812 | 0.5% |
| 51 | 2447 | 0.4% |
| 50 | 30419 |
timeofdaypercent
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.54722021 |
| Minimum | 0 |
|---|---|
| Maximum | 23.98333333 |
| Zeros | 8482 |
| Zeros (%) | 1.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.75 |
| Q1 | 9.583333333 |
| median | 14.25 |
| Q3 | 17.9 |
| 95-th percentile | 22.16666667 |
| Maximum | 23.98333333 |
| Range | 23.98333333 |
| Interquartile range (IQR) | 8.316666667 |
Descriptive statistics
| Standard deviation | 5.774132212 |
|---|---|
| Coefficient of variation (CV) | 0.4262226584 |
| Kurtosis | -0.3782311866 |
| Mean | 13.54722021 |
| Median Absolute Deviation (MAD) | 4.083333333 |
| Skewness | -0.4525256113 |
| Sum | 7535031.617 |
| Variance | 33.3406028 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8482 | 1.5% |
| 16 | 7813 | 1.4% |
| 17 | 7558 | 1.4% |
| 15 | 7323 | 1.3% |
| 14 | 7230 | 1.3% |
| 18 | 6938 | 1.2% |
| 13 | 6655 | 1.2% |
| 12 | 6122 | 1.1% |
| 9 | 6021 | 1.1% |
| 8 | 5729 | 1.0% |
| Other values (1430) | 486334 |
| Value | Count | Frequency (%) |
| 0 | 8482 | |
| 0.01666666667 | 431 | 0.1% |
| 0.03333333333 | 130 | < 0.1% |
| 0.05 | 121 | < 0.1% |
| 0.06666666667 | 99 | < 0.1% |
| 0.08333333333 | 768 | 0.1% |
| 0.1 | 98 | < 0.1% |
| 0.1166666667 | 84 | < 0.1% |
| 0.1333333333 | 97 | < 0.1% |
| 0.15 | 99 | < 0.1% |
| Value | Count | Frequency (%) |
| 23.98333333 | 76 | < 0.1% |
| 23.96666667 | 91 | < 0.1% |
| 23.95 | 75 | < 0.1% |
| 23.93333333 | 84 | < 0.1% |
| 23.91666667 | 450 | |
| 23.9 | 80 | < 0.1% |
| 23.88333333 | 92 | < 0.1% |
| 23.86666667 | 83 | < 0.1% |
| 23.85 | 68 | < 0.1% |
| 23.83333333 | 958 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| January | |
|---|---|
| October | |
| March | |
| June | |
| July | |
| Other values (7) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.164412402 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | May |
|---|---|
| 2nd row | January |
| 3rd row | December |
| 4th row | December |
| 5th row | April |
Common Values
| Value | Count | Frequency (%) |
| January | 49674 | |
| October | 48141 | |
| March | 48110 | |
| June | 47953 | |
| July | 47404 | |
| August | 46677 | |
| May | 46594 | |
| September | 46387 | |
| February | 45759 | |
| November | 45417 | |
| Other values (2) | 84089 |
Length
| Value | Count | Frequency (%) |
| january | 49674 | |
| october | 48141 | |
| march | 48110 | |
| june | 47953 | |
| july | 47404 | |
| august | 46677 | |
| may | 46594 | |
| september | 46387 | |
| february | 45759 | |
| november | 45417 | |
| Other values (2) | 84089 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 168 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 82.87869401 |
| Minimum | 0 |
|---|---|
| Maximum | 167 |
| Zeros | 2693 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 41 |
| median | 84 |
| Q3 | 120 |
| 95-th percentile | 159 |
| Maximum | 167 |
| Range | 167 |
| Interquartile range (IQR) | 79 |
Descriptive statistics
| Standard deviation | 46.93774189 |
|---|---|
| Coefficient of variation (CV) | 0.5663426826 |
| Kurtosis | -1.148587267 |
| Mean | 82.87869401 |
| Median Absolute Deviation (MAD) | 41 |
| Skewness | 0.02702035481 |
| Sum | 46097544 |
| Variance | 2203.151614 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 112 | 6490 | 1.2% |
| 113 | 6397 | 1.2% |
| 89 | 6259 | 1.1% |
| 41 | 6207 | 1.1% |
| 88 | 6206 | 1.1% |
| 40 | 6081 | 1.1% |
| 110 | 6067 | 1.1% |
| 65 | 6034 | 1.1% |
| 111 | 6021 | 1.1% |
| 16 | 5961 | 1.1% |
| Other values (158) | 494482 |
| Value | Count | Frequency (%) |
| 0 | 2693 | |
| 1 | 1081 | 0.2% |
| 2 | 746 | 0.1% |
| 3 | 616 | 0.1% |
| 4 | 713 | 0.1% |
| 5 | 1054 | 0.2% |
| 6 | 2200 | |
| 7 | 3140 | |
| 8 | 5086 | |
| 9 | 4806 |
| Value | Count | Frequency (%) |
| 167 | 1981 | |
| 166 | 2508 | |
| 165 | 2679 | |
| 164 | 3007 | |
| 163 | 3300 | |
| 162 | 3861 | |
| 161 | 4013 | |
| 160 | 4241 | |
| 159 | 4025 | |
| 158 | 4452 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | CRASH DATE | CRASH TIME | BOROUGH | LATITUDE | LONGITUDE | LOCATION | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | DateTime | Year | Time | DayOfWeekNumber | DayOfWeek | hourofday | minute | timeofdaypercent | month | hourofweek | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 13 | 2019-05-21 | 22:50 | BROOKLYN | 40.697540 | -73.98312 | (40.69754, -73.98312) | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing or Lane Usage Improper | 4136992 | �MBU | Taxi | 2019-05-21 | 2019 | 1900-01-01 22:50:00 | 1 | Tuesday | 22 | 50 | 22.833333 | May | 46 |
| 1 | 14 | 2020-01-21 | 15:49 | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | 4277087 | Sedan | Station Wagon/Sport Utility Vehicle | 2020-01-21 | 2020 | 1900-01-01 15:49:00 | 1 | Tuesday | 15 | 49 | 15.816667 | January | 39 |
| 2 | 39 | 2020-12-31 | 16:30 | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Reaction to Uninvolved Vehicle | 4380668 | Sedan | NaN | 2020-12-31 | 2020 | 1900-01-01 16:30:00 | 3 | Thursday | 16 | 30 | 16.500000 | December | 88 |
| 3 | 93 | 2020-12-25 | 20:19 | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Following Too Closely | 4380494 | Sedan | Sedan | 2020-12-25 | 2020 | 1900-01-01 20:19:00 | 4 | Friday | 20 | 19 | 20.316667 | December | 116 |
| 4 | 423 | 2020-04-15 | 15:20 | NaN | 40.671585 | -73.99843 | (40.671585, -73.99843) | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | 4407790 | Sedan | Tractor Truck Diesel | 2020-04-15 | 2020 | 1900-01-01 15:20:00 | 2 | Wednesday | 15 | 20 | 15.333333 | April | 63 |
| 5 | 662 | 2020-10-25 | 2:00 | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | 4360880 | Sedan | NaN | 2020-10-25 | 2020 | 1900-01-01 02:00:00 | 6 | Sunday | 2 | 0 | 2.000000 | October | 146 |
| 6 | 690 | 2020-11-11 | 16:33 | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | 4387870 | Station Wagon/Sport Utility Vehicle | NaN | 2020-11-11 | 2020 | 1900-01-01 16:33:00 | 2 | Wednesday | 16 | 33 | 16.550000 | November | 64 |
| 7 | 847 | 2019-04-17 | 0:49 | NaN | 40.651974 | -73.86542 | (40.651974, -73.86542) | 3.0 | 0.0 | 0 | 0 | 0 | 0 | 3 | 0 | Following Too Closely | 4408571 | Station Wagon/Sport Utility Vehicle | Sedan | 2019-04-17 | 2019 | 1900-01-01 00:49:00 | 2 | Wednesday | 0 | 49 | 0.816667 | April | 48 |
| 8 | 1044 | 2020-04-17 | 1:50 | MANHATTAN | 40.771610 | -73.99046 | (40.77161, -73.99046) | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Driver Inattention/Distraction | 4408441 | Sedan | NaN | 2020-04-17 | 2020 | 1900-01-01 01:50:00 | 4 | Friday | 1 | 50 | 1.833333 | April | 97 |
| 9 | 1145 | 2020-12-18 | 7:00 | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Pavement Slippery | 4377115 | Station Wagon/Sport Utility Vehicle | NaN | 2020-12-18 | 2020 | 1900-01-01 07:00:00 | 4 | Friday | 7 | 0 | 7.000000 | December | 103 |
Last rows
| df_index | CRASH DATE | CRASH TIME | BOROUGH | LATITUDE | LONGITUDE | LOCATION | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | DateTime | Year | Time | DayOfWeekNumber | DayOfWeek | hourofday | minute | timeofdaypercent | month | hourofweek | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 556195 | 688748 | 2018-01-11 | 17:29 | BROOKLYN | 40.615944 | -73.975640 | (40.615944, -73.97564) | 1.0 | 0.0 | 0 | 0 | 1 | 0 | 0 | 0 | Driver Inattention/Distraction | 3828035 | Sedan | Bike | 2018-01-11 | 2018 | 1900-01-01 17:29:00 | 3 | Thursday | 17 | 29 | 17.483333 | January | 89 |
| 556196 | 688749 | 2018-01-21 | 18:27 | NaN | 40.741660 | -73.735886 | (40.74166, -73.735886) | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing or Lane Usage Improper | 3832823 | Sedan | Station Wagon/Sport Utility Vehicle | 2018-01-21 | 2018 | 1900-01-01 18:27:00 | 6 | Sunday | 18 | 27 | 18.450000 | January | 162 |
| 556197 | 688750 | 2018-01-19 | 18:30 | NaN | 40.738487 | -73.989960 | (40.738487, -73.98996) | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing or Lane Usage Improper | 3831311 | Pick-up Truck | Station Wagon/Sport Utility Vehicle | 2018-01-19 | 2018 | 1900-01-01 18:30:00 | 4 | Friday | 18 | 30 | 18.500000 | January | 114 |
| 556198 | 688751 | 2018-01-05 | 11:00 | NaN | 40.791435 | -73.850440 | (40.791435, -73.85044) | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing Too Closely | 3823209 | Sedan | NaN | 2018-01-05 | 2018 | 1900-01-01 11:00:00 | 4 | Friday | 11 | 0 | 11.000000 | January | 107 |
| 556199 | 688752 | 2018-01-13 | 17:30 | QUEENS | 40.713820 | -73.920770 | (40.71382, -73.92077) | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Pavement Slippery | 3828610 | Station Wagon/Sport Utility Vehicle | Sedan | 2018-01-13 | 2018 | 1900-01-01 17:30:00 | 5 | Saturday | 17 | 30 | 17.500000 | January | 137 |
| 556200 | 688753 | 2018-01-26 | 22:36 | BROOKLYN | 40.646640 | -73.924600 | (40.64664, -73.9246) | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | 3837602 | Sedan | Station Wagon/Sport Utility Vehicle | 2018-01-26 | 2018 | 1900-01-01 22:36:00 | 4 | Friday | 22 | 36 | 22.600000 | January | 118 |
| 556201 | 688754 | 2018-01-14 | 15:00 | NaN | 40.677483 | -73.930330 | (40.677483, -73.93033) | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Driver Inattention/Distraction | 3830100 | Station Wagon/Sport Utility Vehicle | Sedan | 2018-01-14 | 2018 | 1900-01-01 15:00:00 | 6 | Sunday | 15 | 0 | 15.000000 | January | 159 |
| 556202 | 688755 | 2018-01-14 | 12:20 | MANHATTAN | 40.867580 | -73.918420 | (40.86758, -73.91842) | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Driver Inexperience | 3830852 | Sedan | Station Wagon/Sport Utility Vehicle | 2018-01-14 | 2018 | 1900-01-01 12:20:00 | 6 | Sunday | 12 | 20 | 12.333333 | January | 156 |
| 556203 | 688756 | 2018-01-23 | 6:30 | NaN | 40.832764 | -73.945830 | (40.832764, -73.94583) | 1.0 | 0.0 | 1 | 0 | 0 | 0 | 0 | 0 | Failure to Yield Right-of-Way | 3834124 | Taxi | NaN | 2018-01-23 | 2018 | 1900-01-01 06:30:00 | 1 | Tuesday | 6 | 30 | 6.500000 | January | 30 |
| 556204 | 688757 | 2018-01-17 | 5:49 | BROOKLYN | 40.725880 | -73.941696 | (40.72588, -73.941696) | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Failure to Yield Right-of-Way | 3829741 | Sedan | Sedan | 2018-01-17 | 2018 | 1900-01-01 05:49:00 | 2 | Wednesday | 5 | 49 | 5.816667 | January | 53 |